Randomization Techniques for Graphs
نویسندگان
چکیده
Mining graph data is an active research area. Several data mining methods and algorithms have been proposed to identify structures from graphs; still, the evaluation of those results is lacking. Within the framework of statistical hypothesis testing, we focus in this paper on randomization techniques for unweighted undirected graphs. Randomization is an important approach to assess the statistical significance of data mining results. Given an input graph, our randomization method will sample data from the class of graphs that share certain structural properties with the input graph. Here we describe three alternative algorithms based on local edge swapping and Metropolis sampling. We test our framework with various graph data sets and mining algorithms for two applications, namely graph clustering and frequent subgraph mining.
منابع مشابه
Randomization Techniques for Statistical Signi cance Testing on Graphs
Studying the patterns and properties of graph data is important in many application areas. A crucial question remains still largely ignored: how signi cant are the data mining results found on the graph data? Currently, the results are mostly justi ed by the optimal or near optimal value of the de ned objective function. We study randomization techniques for testing the statistical signi cance ...
متن کاملSequential Monte Carlo for counting vertex covers in general graphs
In this paper we describe a Sequential Importance Sampling (SIS) procedure for counting the number of vertex covers in general graphs. The optimal SIS proposal distribution is the uniform over a suitably restricted set, but is not implementable. We will consider two proposal distributions as approximations to the optimal. Both proposals are based on randomization techniques. The first randomiza...
متن کاملUncountable graphs and invariant measures on the set of universal countable graphs
We give new examples and describe the complete lists of all measures on the set of countable homogeneous universal graphs and Ksfree homogeneous universal graphs (for s ≥ 3) that are invariant with respect to the group of all permutations of the vertices. Such measures can be regarded as random graphs (respectively, random Ks-free graphs). The well-known example of Erdös–Rényi (ER) of the rando...
متن کاملComparing Random-Based and k-Anonymity-Based Algorithms for Graph Anonymization
Recently, several anonymization algorithms have appeared for privacy preservation on graphs. Some of them are based on randomization techniques and on k-anonymity concepts. We can use both of them to obtain an anonymized graph with a given k-anonymity value. In this paper we compare algorithms based on both techniques in order to obtain an anonymized graph with a desired k-anonymity value. We w...
متن کاملConstructing vertex decomposable graphs
Recently, some techniques such as adding whiskers and attaching graphs to vertices of a given graph, have been proposed for constructing a new vertex decomposable graph. In this paper, we present a new method for constructing vertex decomposable graphs. Then we use this construction to generalize the result due to Cook and Nagel.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009